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Abstract 

Due to the contradiction between energy production and demand in the current energy crisis, power 
consumption (PC) is crucial to the world economy. Energy industry can improve power system control and 
energy usage by using machine learning (ML) models, which are widely acknowledged as a precise and 
computationally efficient prediction solution. For the purpose of projecting energy consumption and 
performance, machine learning (ML) techniques and artificial intelligence (AI) have been suggested recently. 
The various machine learning (ML) techniques i.e. artificial neural networks (ANN), support vector machines 
(SVM), Gaussian-based regressions and Fuzzy logic etc. that have been frequently used in predicting and 


enhancing energy performance are reviewed in-depth in this research. 
Keywords: Artificial Intelligence; Energy Consumptions; Machine Learning 


1. Introduction 

A nation's economic development depends heavily 
on the electric power sector, which also significantly 
improves society through its effective functioning. 
The need for power has _ recently reached 
unfathomable heights as technology has advanced in 
cities all over the world. Global warming issues will 
worsen if national governments on a global scale are 
unable to meet this energy needs. Energy demand in 
India continues to rise at over 3% per year in the 
Stated Policies Scenario from 2021 to 2030. [1] 
According to the International Energy Agency's 2023 
report, the average annual growth rate of total energy 
demand through 2030 is 0.7%. Demand is expected 
to rise until 2050. India is predicted to have the 
fastest-growing energy consumption globally over 
the next thirty years. By 2030, peak electricity 
demand has increased by almost 60% from 2022 [2]. 
Because of the growing population, the comfort 
levels of buildings, and people spending more time 
inside of them, the energy demand will therefore 
climb significantly in the near future. Peak 
consumption can present significant difficulties for 
electricity providers since they have to over- 
dimension the grid to accommodate the unusually 


high load of consumption. In order to prevent serious 
consequences like power outages, energy scarcity 
must be managed by the providers. An alternative 
strategy is to balance the grid by introducing 
intelligent methods for controlling these peaks, which 
will help prevent overreach and lower the costs 
associated with over-dimensioning and enormity. 
There are several ways to control the peaks, including 
load balancing and the development of dynamic and 
intelligent pricing strategies that take into account the 
fact that end users are price sensitive and may cut 
back on consumption when electricity prices are high. 
Therefore, it is necessary to forecast energy 
consumption in order to support national production 
and consumption balance. This review paper focuses 
on the different methods that are employed to forecast 
energy usage. There are three types of load forecasts: 
short-term forecasts, which are typically from one 
hour to one week; medium-term forecasts, which are 
typically from a week to a year; and long-term 
forecasts, which are typically more than a year. For 
the scheduling and control of power systems, short- 
term forecasting is crucial. [3] However, predicting 
daily load is a difficult process since it depends on 
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other factors, such as temperature and the calendar 
effect, in addition to the load from the previous days. 
As a result, anyone looking to save energy should 
focus on improving energy usage and_ the 
effectiveness of energy in buildings. We highlight a 
machine learning-based methodology for predicting 
energy usage in this review. [4] To make the most 
accurate prediction, it is crucial to identify the 
optimal strategy and model. Finding the answers to 
the following questions is the basis of this research: 
1. What are the key elements that have a 
significant impact on how much energy is 
consumed? 
2. How do the daily patterns of electric energy use 
in winter, rainy and summer differ? 
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3. Which machine learning techniques have the 
highest forecast accuracy limitations? 

4. What advantages do various models have for 
accurately estimating how much electricity is 
used? 

Accordingly, Figure 1 depicts the information flow. 
Apart from above information, the rest of the paper is 
divided into the following categories: [7-9] Second 
section discusses the factors affecting for load 
forecasting, and third section the performance 
metrics of the models, last section discusses shows 
the comparative study of the load forecasting 
techniques. Prediction category and duration shown 
in Table 1. 


— 3. Categories of 
sis Electricity Load 
LTF 


4. Factors Affecting for 


Electricity Load 


| MAE | 
5. Comparative Study 


of various models 


Economic 


Figure 1 Flow of Information in the Review Paper for Electricity Load Forecasting 


1.1. Categories of Load 
Forecasting 
There are three categories into which electrical load 
forecasting can be divided, they are Long-term (1—20 
years), Medium Term (a week or month), Short term 
(one week to one hour). [10] The short term 
forecasting (STF) is used to provide the essential data 
for scheduling generator start-up and shutdown, and 
performing a complete examination of gearbox 
constraints. System security is also assessed using the 
STF. In order to economically prepare for future 
additions to the generation system and additional 


transmission planning, long-term forecasting (LTF) \ 


Electricity 


Is essential. The medium-term forecasting (MTF) is 
useful for other elements, primarily for determining 
tariffs, planning maintenance and repairs, scheduling 
fuel supply, and financial management. [5][6]. 


Table 1 Prediction Category and Duration 


Sr. No Prediction Duration 
Category 

1 Long Term months to year 

Z Mid Term weeks to a month 

3 Short Term Hours to week 
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2. Factors Affecting on Load Forecasting 
To get areliable forecast from the system, researchers 
must carefully choose the dependent components. 
Each component has a varied impact on load 
forecasting in varying degrees, and each model may 
take a variety of factors into account, some of which 
are listed below. [11-15] though other major and 
small aspects influence every phase of forecasting, 
time, weather, and economic considerations may be 
thought of as the main influences to take into account. 
Some of the factors are discussed as follows. 
Time Factor: 
In order to anticipate loads, time is the most crucial 
aspect. The load curve contains "time of the day" 
property together with "day of week", "week of 
month", and "month of season" properties, according 
to [16] observations of the load curves of many 
distinct grid stations. 
Weather Factors: 
According to seasonal patterns, winter months 
(October—February) accounted for 20% of overall 
consumption, while the summer months (May-— 
August) contributed 50% to the total. [17] According 
to a study, demand rose 1.5% in tropical Indian towns 
for every 1°C increase in summertime temperatures. 
[18] 
Economic Factor: 
System average load and maximum demand are 
significantly impacted by economic factors including 
power pricing, load § management, and 
industrialization level. [19-23] the accuracy of 
forecasting is also greatly influenced by factors such 
as customer behavior, tariff changes, appliance 
descriptions, population density, and equipment age 
and employment levels. 
2.1. Performance Analysis 
Models used in Prediction 
Each machine learning pipeline _ includes 
performance metrics. As per previous study, various 
distinct metrics are employed to evaluate the models' 
performance, shown in the equations below. 
Mean Absolute Error: 


of Different 


1 7 
MAE = — "— Xx 
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Where: 

n = the number of errors, 

x = summation symbol (which means “add them all 
up”), 

|xi — x| = the absolute errors. 

Mean Absolute Percentage Error: 


Where: 

N is the number of fitted points, 

At is the actual value, 

Ft is the forecast value. 

x is summation notation (the absolute value is 
summed for every forecasted point in time). 

Root Mean Squared Error: 


Where: 
Yi is the actual value for the ith observation. 
Yi is the predicted value for the ith observation. 
N is the number of observations. 
P is the number of parameter estimates, including the 
constant. 

2.2. Comparative Analysis of Error Metrics of 

Different Forecasting Models 

To demonstrate how accurate the load forecasting 
approaches are at predicting the actual load values, a 
number of criteria are employed as evaluation 
indices. [24-27] to measure the precision of their 
models, different researchers have relied on various 
statistical metrics. The most common static 
measurements used by academics around the world 
are shown in Table 2. According to the study, each of 
these measurements has benefits and drawbacks of its 
own. [28-32] mostly used metrics are MAPE, RMSE. 
The second-degree loss function provided by RMSE 
prioritizes larger errors rather than minor ones. It can 
measure average error naturally and is unambiguous. 
MAPE is simply applied to both large volume and 
low volume items and is not scale dependent. Figure 
2 shows Prediction Matrices Studies. 
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Table 2 Performance Matrices and Comparative Study of Various Models 
Model Reference MAPE RMSE MAE MSE R? 
ANN [20] 3.38 2.19 - - 
4.51 
[21] 3.50 2.10 = 2 
[22] - - 1.12 - 
[23] 1.82 z 2 : 0.96 
MLR 
[24] 5.05 “ = 4 0.89 
[25] 51.45 0.8649 0.6278 0.7480 - 
LSTM [26] - 3.337392 - - - 
[27] 6.27 0.0154 0.0107 - - 
[28] 4.60 6.10 0.92 
RF 7 7 
[29] 5.45 3.219 0.576 
[29] 4.913 
- - 0.277 
SVM [30] 1.9705 2 
1.5438 - - 
[31] 1.682 12.364 
[30] 3.6082 = 1.4826 : : 
KNN 
[32] 3.95 - : = - 
33 0.99 - - - - 
a [33] 
[34] 1.94 202.1 - - - 
[29] : 2.914 
XGBoost - - 0.711 
[34] 2.95 368.4(kWh) 
GBDT [35] - 4.748(%) 0.625(KW) - 0.982 
CNN [36] 0.54 59.45 14745.41 


2.3. Models used for Predicting the Electricity 
20 Load Consumption 
Table 3 shows comparative study of machine 


E learning models for Prediction the review's objective 
& . is identifying the most popular algorithms used in 
2. 2 electricity forecasting models. The sixty-one (61) 
wg article contains eleven (11) of the most frequently 


used algorithms. [33-37] the results of the study 
demonstrate how applying AI to different economic 
- sectors can increase productivity and efficiency. 
: According to a study, ANN models are frequently 
employed for prediction, and fuzzy logic is also 
growing in favor these days. Figure 3 shows the 
Comparative Studies. 


MAPE RMSE MAF MSF R2 
Error Metric 


Figure 2 Prediction Matrices Studies 
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Figure 3 Comparative Studies 
Table 3 Comparative Study of Machine Learning Models for Prediction 
Categories Time ‘ Reason of Categories of Performan 
of model Resrence oe factor Dae prediction Features ce Metric 
: 3.72%RMSE 
: Short Term Time, day, oT 
Fuzzy Logic [46] Residential | Hourly 4year Load temperature, season, 2.1 %MAFE 
ANFIS . 1.85%MAPE 
Forecasting daylight 3. 66%RMSE 
Energy | Cindustries, 
ANOVA [21] on Hourl 24 years consumplen roducts, equi ment pee 
ANN Residential | say inindustry |? eee 2.10MAE 
technologies, 
sectors 
processes 
wind direction, sun 
Building radiation, humidity 
ANN [38] Residential Hourly | Smonths | Energy level, and 0.0323(CV) 
Prediction temperature of the 
outside dry bulb 
Accurate Temperatureandhumi 
a [41] ee Hourly | 8years peak dity,windspeed,cloud N/A 
= forecasting [cover daylight Hours 
Sunlight sun radiation, 
exposure and cecutameans 
ANN [38] Residential Hourly lyear P temperature, wind 25.50%WMAE 
aad direction, and 
of the load Sette 
sampling time stamp 
The building's size, 
: occupant level, 0.29MAPE 
CNN [45] Residential Hourl 2years Paienare RICCING AEDUBNOGS: 1.32MAE 
SVM =| ae ada humidity and 2.99MSE 
gs temperature, and 0.23RMSE 
calendar element 
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Most The distribution of 
FF. dominant the lagging electrical 1.42%MAPE 
DNNR- [39] HeGuonial Hourly | 4years factors for load Across time, 306(MW/h)RMS 
~DNN power weather, time, and E 
consumption | holidays, and the 210(MW/h)MAE 
prediction price of electricity 
ANN [40] Non Hourly | 4years {Management temperature 44,111 IMS 
residential un 160.99752MAE 
Operation of 
Power 2.72%MAPE 
System 
cloud-based 
appHcaon Sensors,cloudstorage 
ANN [22] Residential Hourly lyear for electricity cacti tae 1.12MAE 
. lustering 
consumption 
analysis 
Electricity 
demand of a 
bio-climatic | temperature of the 
ANN é : building and dry bulb, sun 
PCA [eel Resaennal Bowe), po one influence of | radiation, humidity MA 
temp and ___|ratio, and wind speed 
solar 
radiation 
0.0219 IMAE 
ANN [20] aoe Hourl 3 years yea N/A 1.26268e— 
PSO residential y y or 003MSE 
prediction 0.03388MAPE 
time factor, historical 
the exact : : 
: weather information, 
capacity to 
customer class, 
progues tie revious load that the 
ANN [43] Residential Hourly |One Week| power and previ 1.066%MAPE 
region demanded, 
the demand 
region growth, and 
from 
amount of load 
customers : 
increased 
Construction 
of component solar radiation, 
ANN [44] Residential Hourly | One day | load model |ambient temperature, N/A 
for house humidity 
hold 


3. Major Findings with Research Gaps 


Duration-based model performance analysis 

The majority of current models only anticipate loads 
for the short term, as opposed to long- or mid-term 
periods. [38-40] this could be due to the fact that a 
longer forecasting horizon greatly increases the 


likelihood that changes that are not yet known to us 
will have an impact on demand in the future. 


The influence of the temporal domain 

The prediction capabilities of many models, which 
rely on either residential or non-residential buildings, 
are inadequate. Achieving equilibrium between 
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residential and nonresidential models could be 
feasible if such a model could be created. [41] 
Model Flexibility Issues 
The models that have been studied during the 
research are intended for a certain use case, such as a 
particular sort of input parameter or geographic area. 
Throughout the investigation, no model has been 
discovered that can function with the best accuracy 
regardless of application changes. [42] 
Hybrid Models 
Offer greater accuracy than  single-method 
approaches in predicting load consumption. 
4. Future Trends 

4.1. Model Enhancement 
Research evaluation indicates that no model takes 
into account the customer class and concentrates on 
year-round peak hour prediction. So need to put 
further effort on clustering techniques. 

4.2. Cloud based Applications 
The greatest accuracy is reported for clustering 
approaches when real-time electricity consumption 
data is collected using a cloud-based application for 
electricity consumption analysis. However, 
additional work on real-time data is required in the 
future. 
Conclusion 
A thorough analysis of several load forecasting 
models covering a range of time periods and sectors 
has been provided in this paper. We went over the 
effectiveness and drawbacks of the conventional load 
forecasting techniques. Lastly, drawing attention to 
the constraints, patterns, and areas in need of further 
study will help the readers choose and assess the 
models in accordance with their needs and obtain 
guidelines for future work. The goal of future 
research will be to improve the drawbacks of the 
current load forecasting models. 
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